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Abstract —This paper is focused on survival analysis of elec¬ 
trical components. The main goal of this paper is to develop 
a method for on-line estimation of the Mean Time To Failure 
(MTTF) of electrical components under dynamic stress levels. 
The proposed method models the variations of the stress levels as 
a stochastic process. Hence, a stochastic failure rate function can 
be developed for each electrical component. Later, this function is 
used as the underlying rate of a doubly stochastic Poisson process 
(known as Cox processes). Fnrthermore, this Cox process is nsed 
for on-line estimation of the Mean Residual Life (MRL) using 
the observed stress levels. The proposed method provides a good 
estimate of the age and life expectancy of each component. An 
experimental case study is provided to demonstrate the proposed 
method. 

Index Terms —Reliability, Cox, doubly stochastic Poisson pro¬ 
cess, survival analysis, on-line reliability, step-noise, Weibull, 
stochastic jump, MTTF, MRL. 


I. Introduction 

M odern electrical systems are often operating near 
their margins of safe operational conditions. In order to 
reduce the cost and real estate requirements of these systems, 
traditional safety margins are reduced and these components 
are exposed to higher stress levels. Hence, dynamic monitor¬ 
ing of the operational state of these components is crucial. 
Examples of over stressed electrical systems can be found in 
modern microgrids and hybrid electric vehicles. Eor instance, 
in a modem microgrid, distributed power electronic converters 
interfacing renewable energy sources and storage system are 
under dynamic variations of the operational conditions. There¬ 
fore, thermal and electrical stress level on each component 
depends on the state of power flow within the microgrid and 
is not a fixed value. Hence, traditional off-line methods for 
reliability analysis of these systems cannot provide accurate 
models for survival analysis of these systems. 

In order to monitor the state of health and age of these 
components, reliability analysis methods are incorporated. 
Reliability deals with the ability of each component to perform 
a required action. However, this qualitative parameter is hard 
to describe in the mathematical sense. Eor this reason, this 
parameter is often described by the life expectancy of the 
component. Eor this reason, in this paper, the reliability of a 
system is analyzed by studying the survival function. Majority 
of the studies in the held of reliability can be categorized into 
two branches. Component level reliability deals with the prob¬ 
ability distribution of the time to failure of a single component 
Q-@- On the other hand, system level reliability studies 
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the reliability of a system whether it is developing a life¬ 
time probability distribution model based on the probability 
distributions of individual components, or studying the state of 
reliability (operation, safety, health, etc.) of the system using 
mathematical models @-0. The latter is often performed 
using net structures such as Markov chains and processes. Eor 
instance. Markov/hidden Markov models have been widely 
used to estimate the state (of health) of a system 0 ^ 0 - 
Another approach is to optimize the system for maximizing 
the reliability G0-1I3- 

Conventional reliability analysis methods are mainly fo¬ 
cused on exponential probability distributions p3)-p3). This 
is due to the simplicity in working with the exponential 
probability distribution. In particular, the Markovian (mem¬ 
oryless) property of this distribution enables for derivation 
of closed form time domain models for the net (graph) 
structures developed with this distribution. Markov processes 
is an example of applications of exponential distribution in 
a net structure. The base exponential model is calculated 
based on empirical measurements on accelerated life tests on 
each component (or component family) Q, 0. The results 
from these measurements lead to failure rate functions that 
can be used for survival analysis GD, |T7)- In order to 
improve the accuracy of the survival model, more complex 
probability distribution functions can be incorporated ID- 
Eor instance, Weibull distribution is of a great importance in 
reliability analysis. Distribution of various empirical data from 
accelerated life tests can be modeled using Weibull distribution 
function Eor this reason, Weibull is often incorporated to 
increase the accuracy of the reliability model 119|. 

On the other hand, in conventional methods, after a distri¬ 
bution function is selected (whether it is a simple exponential 
model or a more real-life-like model), it is tuned based on 
the expected operational conditions during the life span of 
the component. In order to be on the safe side, the hazard 
rates of these models are tuned for the worst case scenarios. 
However, the electrical components under study will not 
constantly suffer from the maximum stress levels. Consider a 
battery storage system in a microgrid. Depending on the power 
demand within the grid or seasonal conditions, the thermal and 
electrical stresses over the battery charger will vary. Hence, an 
on-line method for monitoring the stress levels and aging of 
the battery charger is of interest. 

This paper presents an on-line method for calculating the 
age and life expectancy of electrical components. Eirst, a 
stochastic model for modeling the expected stress levels based 
on the operational conditions is developed. This model is used 
as the stochastic acceleration factor in modeling the survival 



functions of each component. This is performed by introducing 
Cox processes as the distribution function of the time to 
failures. The developed process is used for estimating the 
mean residual life of the component based on each observed 
stress level. Finally, an experimental study is provided to 
demonstrate the proposed method. 



II. Survival Models 

In the probability theory, stochastic processes which deal with 
random collections of points are called point processes. In 
particular, this paper is incorporating point processes to study 
random occurrence of incident. In fact, these incidents are 
failures of electrical components. Renewal processes have 
been widely used to study the reliability of electrical system. 
In this paper, a renewal process with a stochastic rate of the 
process is studied. First, some introductions are required. 

A. Renewal Processes 

Let n be a sample space and let F = 55(17) be the cr-algebra 
(Borel cr-algebra). Let P : —)• [0,1] be a measure on F. The 
quartet (17, {-Ftlooi P) is a filtered probability space with 
a natural filtration of {Ft}t^i, where I G K+ (practically, I G 
N) is a totally ordered index set. Although the filtration might 
not be of great importance for the analysis of a Homogeneous 
Poisson Process (HPP), it is required for the later sections. 
The history is defined as 

F 30 .- ^ T (1) 

Fr = {AG F\An{T <t)G Ft,yt > 0} (2) 

On the other hand, let X be a topological space with its Borel 
cr-algebra ®(X) to define random variables a; € X as 

x: (f7,J-)^ (X,$(X)) (3) 

Consider an orderly counting process A/^(t)t>o which is 
adapted to the filtration Ft- In this process, Af{t) represents 
the number of incidents during the time interval (0,7] while 
A/^(0) = 0. For now, it is assumed that the process has 
independent deterministic increments. Hence, the infinitesimal 
probabilities of the process are defined as 

Pr{M{t + S)- M{t) = l\Ft} = w{t)S + o{S) (4) 

Pr{M{t + S)-M{t)>l\Ft} = o{S) (5) 

where w{t) is the renewal density (in probability theory) or the 
rate of occurrence of failure (ROCOF) (in reliability theory). 
This function defines the rate of the process. Before studying 
the process any further, let T : 17 —> K+ denote a random 
variable representing the interval of the time till the first 
incident in the process (i.e. (0,Ti]). The survival function of 
this incident is defined as 

St, (7) = 1 - Ft, (7) = Pr{ri > 7} (6) 

This function represents the probability of the incident 
throughout the time. Hence, the probability of having a failure 
at time (7,7 + St] while the unit has survived till 7 is derived 
using 

A(7) = Pr {7 < Ti < 7 + 5|7 < Ti}/S\s^o = (7) 


Fig. 1. The bathtub failure rate function. 


where Jt, (7) = Ft, (7) is the probability density function. A(7) 
is defined as the failure rate function or the Force Of Mortality 
(FOM). One should avoid interchanging the ROCOF and the 
failure rate function since the failure rate function represents 
the hazard rate of a single incident whereas ROCOF defines the 
rate of incidents in a process. Further discussion is available 
in 1201 . 


B. Improvements to the Failure Rate Function 

Conventionally, the majority of studies on reliability as¬ 
sessment in electrical systems consider a constant failure rate 
function for each component. This failure rate function is 
known as the failure rate for short. Using a fixed A(7) = A, 
the survival function is derived using 

S{t) = e- -fo ^ g-At 


Memory-less property of exponential distributions results 
in simplicity of analytical studies. However, it has been 
observed that a constant failure rate function cannot represent 
the measured hazard rates in a practical application. Many 
studies suggest using a bathtub curve to include the infant 
mortality and wear-out of individual components. For instance. 
Fig. 0 illustrates a recommended hazard rate for modeling 
electrical components by the American telecommunication 
provider, AT&T This curve does not consider the wear-out 
period. However, the infant mortality is included using Weibull 
distributions. This hazard rate models the infant mortality for 
nearly a year (10 thousand hours). Afterwards, it is reduced 
into a constant failure rate model (or a Weibull with a ~ 0+ 
for more accurate results). The survival function of such failure 
rate model is calculated as 


S(t) = 


^b'^0 i 


e 


t < To 
To < 7 


(9) 


where Tq G K"*" is the base transition time (10^ hours ^ 1 
year). Several studies have developed tables for components 
commonly used in electrical systems 0,(2). a G [0,1) is the 
empirically measured shape parameter of the Weibull distri¬ 
bution. The data regarding the shape parameters are available 
based on accelerated life test studies. One can observe that (91 
is not a standard (a„j,Aw) Weibull distribution. However, (91 
is generated with the change of variables of a^, = 1 — a and 
= (A6To“/(l-a))i/(i-“). 

The base failure rate, Af, G K"*", in (|^ is calculated based on 
results from empirical measurements. It should be noted that 
the results from different measurements do not necessarily lead 
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to same failure rates. This is due to the components under test 
as well as type of the accelerated testing. Let the component 
under study be an npn Bipolar Junction Transistor (BIT). In 
order to simplify the study, a 45V 0.5A switch with a transition 
frequency (i.e. fx) of 100 MHz is selected. It should be 
noted that the presented method is similar for every electrical 
components and the selected transistor is to provide a case 
study for clarifying the method. For Weibull distribution, the 
operational failure rate function of this switch can be estimated 
using 

X = Aaia)T^Xb ( 10 ) 

where Xt is the base failure rate with a value of 20 x 10“® 
failures per hour for the selected BJT. a is 0.6 for the selected 
switch. Aa{a) G [l,oo) is the stress related aging factor 
(acceleration factor) which is calculated using 

A„(a) = (Ar4le4l,„)(i-“) (11) 

where Aen = 1 is the environmental factor. Ag is the electrical 
factor and depends on the type of the component. For the case 
of a BJT, this parameter can be calculated using 

g ^’CE 

Ae = max{l, 0.47e '“ce } (12) 

where the breakdown voltage of the switch is = 45 
while the operational voltage is vce- Furthermore, the thermal 
coefficient At can be derived using 

At = T- To (13) 

where Ea is the activation energy of 0.4 eV. Kt is the 
Boltzmann constant. is the reference temperature of 313 K 
while To is the ambient operational temperature. Hence, the 
failure rate function for this case study can be approximated 
by linearization near the nominal operation point as 



Fig. 2. A sample path of the failure rate function As(f). 

Hence, the rate of this point process itself is a stochastic 
process. Cox processes have been widely used in economic 
studies ||2^-||28l. 

C. Step Noise-Cox Process 

It was mentioned that the aging factors governing the gain of 
a failure rate function have stochastic variations. The failure 
rate function of a BJT was developed in ( [T4| . It can be 
observed that both inputs of temperature and voltage have a 
same effect on the overall failure rate function. If variations 
of multiple parameters is of interest, due to linear nature of 
( [l4| , the random variables are summed and hence, individual 
characteristic functions can be multiplied. For this reason, 
this section studies the stochastic variations of one parameter 
(voltage). Variations of the voltage of the switch can be 
modeled using a simpler stochastic process. Let 

Mt 

VCE = Vt = Vo Vk{u{t - Sk) - u{t -Sk- r)) (15) 

k=l 


X ~ (14) 

r A6ro“8.4^-“(l + (l-a)(0.06u + 0.04fa))f-“ t<T 
{ A68.4(1 + (0.06F + 0.04fa)) T<t 

near the operation condition of u = 30 V and Tg = 333 K. 
It should be noted that under accelerated conditions, Weibull 
distribution will transition into an exponential distribution at 
time T < Tq (this transition occurs sooner than the time Tq 
dehned by failure rate tables such as AT&T handbook). There¬ 
fore, T can be calculated by finding the point where Weibull 
and exponential failure functions meet. In a conventional 
survival analysis, the failure rate function is calculated based 
on the nominal operation conditions. However, in a practical 
application, the aging factors (i.e. v and To) vary with time. 
For instance, seasonal stochastic variations of temperature can 
lead to accelerated aging of a component. Hence, to achieve a 
more accurate failure rate function, on-line calculation of the 
acceleration factor is of interest. Moreover, this suggests that 
the acceleration factor has a stochastic nature. One solution 
is to incorporate doubly stochastic Poisson processes known 
as Cox processes. Cox processes were hrst introduced by 
D. R. Cox in observation of patterns in empirical data HD, 
p2|. This process benehts from random failure rate functions. 


be the stochastic process estimating the variations of the oper¬ 
ational voltage. In this process, random variable z/fc : Hi. —>^ K 
represents a random voltage step on the voltage of the switch 
while vq = 0. The above process is similar to a compound 
Poisson. This paper refers to Poisson processes with the above 
rate function as step-noise cox processes. Here, {A4t}t>o is 
a homogeneous Poisson process with intensity p. t G K"*" is 
the length of each step noise in time. is the time until the 
fc-th arrival and is calculated using 

Mt 

Sfe=^Tfc (16) 

fc=i 


where T^ : H —IR+ is the /c-th inter-occurrence time. The 
stochastic process dehned in (151 models the random behavior 
of the voltage of the switch, therefore, the stochastic variations 
of the accelerated failure rate function can be modeled using 
•HI and S combined as 


Mt 


X 


= - Sfe - T))t 


(17) 


k=l 


It should be noted that Aq is the starting value of the A while 
K represents the gain of the perturbations with respect to ([l4|. 
































Also, in the previous example, Aq = 165 x 10“® and k = 
165 X 0.066 X 10“®. This function can be derived for the case 
of exponential distribution by setting a = 0. Additionally, due 
to the characteristics of the compound Poisson process in ( [T7| , 
replacing u{t — Sk) with 1 does not change the function. 

A sample path for the stochastic behavior of the exponential 
failure rate function is illustrated in Fig. In this figure, the 
random steps of the voltage, i/^, have a uniform distribution 
on [—5,5]. It can be deduced from this simple example that 
variations of the voltage of the selected BJT can change 
the average number of failures from 60 to 280 incidents per 
million hours of operation. The reason for proposing a step- 
noise process in this paper is the behavior of power electronic 
converters. Consider a source with a series impedance. Due to 
the step changes in the duty cycle of the converters, the voltage 
drop over the series impedance of each source is in the form of 
a step. These voltage steps are highly observable in utilization 
of a stair-case method for MPP tracking of photovoltaic solar 
panels. 

Due to the stochastic variations of the failure rate function. 


f X{s)ds < oo, X{t) > 0 > 0 

Jo 


(18) 

And for 0 < s < t and Vm € K while having the 
full realization of X{t) as the conditional characteristic 
function of the resulting Poisson process is defined as 




where — Sk) — {t — T)^~°‘u{t — Sk — t))/(1 — a) 

is approximated as tS^“ since r ^ f. In order to derive 
the characteristic function of the Cox process, extension of 
Fokker-Planck-Kolmogorov for jump-diffusion based on Ito 
calculus is incorporated. This extension which is useful for 
jump processes was originally developed for piecewise deter¬ 
ministic Markov processes @-131). Using this concept, the 
infinitesimal generator of the process A 2 can be calculated for 
the characteristic function (Ai is derived in the exact same 
way, however, since A 2 is more complex, we demonstrate 
derivation of this part and we leave the first part to the reader) 


/(A2,f) = E[e-^“^^|A2j 

using 

— at + aAo at 


p[ /(A2 + KTC,i)dG(C)-/(A2,f)] 

Jv{G) 


(23) 


(24) 


(17 1 is denoted as A(f). The process is assumed to be operating 
on the same probability space of (U, {/"tltxO) P) which 
was defined earlier and is measurable. The process X{t) > 0 
is adapted to J'/' (and in fact is predictable). The counting 
process generated by this intensity is called a doubly stochastic 
Poisson process or Cox process. The necessary condition for 
validity of the following studies is to eliminate any explosions 
in the process. Hence, let 


since dA 2 /dt = —at ^A 2 , with a change of variable of 
z = A 2 f“, the partial derivative of f{z,t) with respect to 

df ( dz dA2 I dz\ _ 


has the terms + f) = of 


(19) 

which is quite well-known for Poisson processes and therefore, 
the probability of having k failures can be calculated using 

- /I 

-la. - (20) 

where denotes the realization of the stochastic process 
generating the failure rate function. In order to benefit from 
this process in a practical application, the dependency to the 
future has to be eliminated (i.e. Also note that 

A(f) = f X{u)du (21) 

Jo 

pt Mt 

= / t~°‘{Xo + n'^i^kil-u{t - Sk - T)))dt 

•^0 fc=i 

is the aggregated process. At, which affects the characteristic 
function of the Cox process. Additionally, the processes in 
this paper are optional cMlag processes. Af can be integrated 
directly as a new compound Poisson process 

Mt 

A{t) = Aot^“^/(l - a) + AC ^ = Ai + A 2 (22) 


aA 2 f“ = 0 and therefore, this change of variable is very 
useful by transforming the infinitesimal generator into 

Af{z, t) = %+p[J f{z + nrCe, .)dG(C) - /(.,.)] (25) 

and hence, df is calculated as 

dfizA)=Af{z,t) (26) 

+ [J f{z + KTCt°‘,t)dG{C) - f{z, t)]dNt 

where dNt is the compensated jump process (i.e. dN — Xdt). 
Therefore, based on Watanabes characterization of a Poisson 
process, this term is a martingale. G{x) is the probability 
kernel and in general is a function of time as well (i.e. G{x, t)). 
In the special case studied in this paper, G(.) represents a 
probability distribution of random voltage steps 

It is very important to note that the infinitesimal semi-group 
is being used for a non-Markovian process. Therefore, care 
should be taken to include the calendar time in the derived 
equation and not the time since the last observation t — s. 
Therefore, by t, all of the above and following equations 
refer to the calendar time t starting from to = 0 which is 
the first time the device was installed. In this application, 
the transition semi-group T* ^ should be used and not the 
Markovian transition semi-group Tt. Moreover, At is written 
as A with an abuse of notation. 

With respect to (|26), a reference function in the form of 


(27) 




is selected. Moreover, it should satisfy the boundary condition 
and should be multiplied with the characteristic function of 
A 2 . Hence, 


E[e-^“^‘|Ao] = “Ao/(l-a) 

JoP{j 


( 28 ) 


-JUlCKTk 


X e 


dG(C)-l)dk 




Now, we can observe the characteristic function of the Cox 
process. First, it was mentioned that due to the conditional on 
the future, the characteristic function of the Cox process can¬ 
not be directly calculated. Hence, the expected characteristic 
function can be derived using 

^ jtA] ^ ( 29 ) 

which can be combined with ( [28| ) (using a change of variable 
of —ju = — 1) and assuming s = 0) to derive 

E[e-^“^‘|Ao] = (30) 


for Ao = 0 and the derivation of the characteristic function of 
the proposed step noise-Cox process is hnished. 

Following relations are observed based on (28 1 

E[At|Ao] = (Ao -f pTKo^GM)t~°‘ (31) 

E[At|Ao] = (Ao -f pTKoEGM)f^““/(l - a) (32) 

It is very important to note that the compound Poisson process 
in A is windowed. Otherwise, the expected value of the process 
would be E[At|Ao] = Aof““ -f E[At|Ao] = 

Aof^““(l—a)-|-pKoEG[H^^~'^/(2~CA)- However, in this paper, 
each new step noise will survive only for a duration of r. 
Based on (30i, ROCOF w{t) — E[A/t|fo = 0, Aq] is calculated 
by getting the derivative of the characteristic function at u —> 
0. Therefore, 

^1 —O' 

E[A/'t|fo, Ao] = Ao-e;-^ + pKT 


Ck-°‘dG{C)dk (33) 


(l-«) 

and Pr{A/) = 0} = S{t) = Pr{T/ > t}, hence 
Pr{T > f} = 

for A/q = 0. By suppressing G (i.e. letting fciy) = 5(y — Q)), 
the model simplihes to Weibull as 

V(l-a) 


(34) 


Pr{T >t} = e~^°* 


(35) 


and by letting a \ 0, exponential distribution for modeling 
reliability of electrical components is derived. These two mod¬ 
els are traditionally used for reliability analysis of electrical 
components. Hence, the proposed model is more detailed and 
can fully contain the conventional models. 

In this section, the stochastic failure model of an electrical 
component based on a step noise-Cox process was developed. 
Next section will use this model for on-line survival analysis 
of a component. 


this paper is proposing On-line or Real-time survival analysis 
by updating the failure rate functions timely with respect to the 
observed operational conditions. For this purpose, the concept 
of residual life is used 

Rl{a;|s} = Pr{r>a;-fs|r>s} (36) 

where the inter-occurrence time, T, is the time to the hrst 
failure (more accurately, T is the time to a failure with respect 
to the preceding failure), s > 0 is the time that the process 
was observed. Therefore, s is the moments that the failure 
rate function is recalculated based on the new operational 
conditions, a; > 0 is the time in the future. Therefore, if we 
dehne a global time (calendar time) t, then at any moment after 
s,t = s-\-x. Moreover, it was assumed that the variations of the 
failure rate function has a stochastic nature. On the other hand, 
at an observation time, s, the failure rate function prior to s has 
been observed. Hence, the measured hazard rate values have 
a deterministic nature with respect to hltration A),. Therefore, 
the residual life of the process is 

Rl{a;|s} = g-((^+«)'-“-^'““)A,/(i-a) ( 37 ) 

X ^-PKst({x+s)'^~‘^Eg[v]-s^~‘^U)/{ l-a) 


where M is the mean value of the observed fluctuations on 
the failure rate function. In many applications, it is assumed 
that M is zero. However, claim is that in many applications, 
this value is not zero since the assumptions on stress factors 
do not necessarily occur and stresses can be higher or lower 
than the assumed values. In order to expand this definition for 
the bathtub curve of Fig. [T] the residual life can be calculated 
using 


Rl{a;|s} = lx+s<Tte~ 

X e“ 

+ ^x+s>Tt,s<Tt^ 

X e“ 

X g-(®+*-^t)A° ^ g-pK°r(x-|-s-Tt)EG[!^] 
+ ls>T X 


(38) 


where A° is the modified failure rate function at the moment of 
transition. As is the modified failure rate function based on the 
operational conditions, Ks and are the modified gains of the 
random small signal variations of the voltage and temperature, 
respectively. These parameters for the case study of the BJT 
can be calculated using 


HI. On-line Survival Analysis 

In conventional methods, the failure rate functions for a 
component are calculated using the expected information from 
the operational conditions. However, in a real life application, 
these expectations do not necessarily occur. For example, 
consider a power converter deployed to a location. This 
location has an average summer temperature of 100° F while 
the average winter temperature is 30° F. It can be observed that 
the change in these operational conditions are not included in 
the off-line failure rate function calculations. For this reason. 


^'^CE 1 0.4ey , 1 1 

Aa{n) =0.47e'"iB k, ^313 t;> 

(39) 

A, = A,Ai-“(n)To“ 

(40) 

t^SV ~ -^8(1 ~ ^ 

(41) 

KsT - 

(42) 

A° = XbAain) 

(43) 

0 _ \0/ 3 '1 

(44) 

.,0 _ \0r O.ieV ^ 

^ST Kb{T‘)'^ ' 

(45) 
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Fig. 3. A 5 year sample path of the failure rate function As. 


where Aa is the base acceleration factor calculated at the 
observation time s (based on the observed operational con¬ 
ditions). Indexes v and T determine the k for variations of 
voltages and temperatures, respectively. Tt is the transition 
time. This value represents the remaining time to pass the 
infant mortality band. For the n-th observation period, this 
value can be calculated using 


n) (y TAge) 

n—1 

(46) 


(47) 


where Tsmp is the time period between each on-line residual 
life calculation. Tq is the base infant mortality period which 
was described in the previous section (i.e. 10^ hours). T^ge 
represents the age of the component which is affected by the 
acceleration factors. Therefore, a component can be older than 
the elapsed calendar time. Aa{k) is the aging acceleration 
factor estimated at the fc-th on-line estimation period and was 
described in (11 1 . Aa{n) is calculated using (111 based on the 
average voltage and temperature data measured since the last 
estimation period. 

In order to illustrate the transition between the Weibull and 
exponential (infant mortality and constant failure rate) regions, 
a sample path for the failure rate function is illustrated in 
Fig. In this figure, the average acceleration factor is kept 
constant at 4. Therefore, the base transition time is reduced 
from 1 year (10^ hours) to 2500 hours. In addition, it can 
be observed that the average failure rate is constant and 
stochastic fluctuations with zero mean are affecting the failure 
rate function. Moreover, impacts of the acceleration factors are 
lower during the infant mortality band. 

An important note regarding ( (39l l is that in this equation, 
the variations of the voltage and temperature are included 
at the same time. However, the stochastic behavior of the 
failure rate function was derived considering the variations 
of the voltage and not the temperature. If the variations of all 
parameters occur at a same arrival time (which does not affect 
the accuracy of the results of this paper), then all of the derived 
equations remain the same. However, characteristic functions 
of the process related to temperature and the process related to 
voltage should be multiplied. This results in a survival function 


in the form of 

Pr{T >t} = ( 48 ) 


Using (381, the Mean Residual Life (MRL) of a component 
can be calculated by 


pOO 

MRL(s) = ii{s) = / Rl{a;|s}dx 

Jo 


(49) 


In general, a closed form solution to this integral is not 
available. However, numerical approximation of this integral 
can be calculated using Riemann sums at each observation 
moment. MRL of the ((38|) can be obtained using 


^0 

^}dx 


(50) 


X e 


X e 


Pk^tEg[i^\ 

-pTKs{Tl~°‘EG[v]-a 




*M)/(l-a 




' [ A^-t-p^O'^EGfi^] -I 


which only requires a Riemann sum over [0, Tt — s] for s < Tt. 
Considering the small length of this region and common base 
failure rates for electrical components, a sum with step length 
of 10 to 100 hours can provide a very accurate result for the 
MRL. M can be either neglected (assuming that the mean of 
all fluctuations in the life time of the component is zero) or 
for accurate results, can be derived by calculating the mean 
error of the stress factor over all times (i.e. M = ^ /N 
where v{k) = v{k)—v* is the observed voltage fluctuations at 
the time k and N is the total number of observations). On the 
other hand, \v\ can be calculated as a moving average over 
these fluctuations. The moving average window is a design 
parameter. One can consider a day or a week for the window 
of this moving average. This window claims that the expected 
mean of the future events based on the newly observed stress 
factors should be similar to what we have observed in the 
past day or week. One can select Eg[z/] = M assuming that 
the mean of all events in the future is the mean of all past 
events. ( [39] l can be updated in the same manner. One can use 
the new voltage and temperatures to update this parameter. On 
the other hand, to observe a smooth model, one can use the 
average of voltages and temperatures over time. Therefore, at 
a time t 
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where ^ is the cardinality of the set of all samples. 






















Fig. 4. The measured voltage of the battery. 
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Fig. 5. Mean residual life of the capacitor. 


IV. Case Study 

In this section, an experimental case study is performed using 
the proposed on-line survival analysis method. In this example, 
the survival of the input capacitor of a dc-dc converter under 
variations of the voltage of the source is studied. Consider a 
dc-dc converter that is supplied from a 120 V battery pack. 
The input capacitor of this converter is directly connected to 
the battery pack and will observe the voltage variations of 
the pack. In order to expedite the measurements, the data is 
collected every 1 minutes. However, the analysis is performed 
assuming that the data is collected at each hour. This expedites 
the measurements from 1.5 years to only 6 days. The measured 
voltage of the battery is shown in Fig. It can be observed 
that based on the state of charge of the battery, the voltage of 
the battery varies between 110 V to 130 V. 

In the conventional methods for estimation of the survival of 
the capacitor, the nominal battery voltage is used to calculate 
the distribution functions as well as MRL. However, in this 
paper, the function is calculated based on the measured voltage 
of the battery at each measurement cycle. Consider a case that 
the battery has been depleted for a long time. Conventional 
methods will consider the capacitor to be aging with a constant 
acceleration factor that corresponds to 120 V. However, the 
proposed on-line estimation method will calculate the failure 
rate based on the measured voltage of the battery. Hence, the 
model will use a smaller aging factor for this case. 


Fig. 6. On-line calculation of the capacitor aging factor. 


The on-line MRL estimation for the capacitor based on 
the measured voltage of Fig. is shown in Fig. This 
figure illustrates the difference between on-line and off-line 
MRL estimation methods. The off-line methods including a 
constant failure rate model and the bathtub model do not 
consider the variations of the voltage of the battery. However, 
the on-line method will use the data collected up to the 
measurement time to calculate the failure rate function in the 
model. Moreover, the expected failure rate function (of the 
future) is estimated using the Cox process. Therefore, the 
actual age of the capacitor is estimated more accurately. In 
order to study various applications of the proposed model, 3 
candidate models are illustrated. In all of these models, M is 
calculated as In the first model (illustrated in blue), the 
expected mean of voltage stresses in the future (i.e. E( 3 [j^]) is 
simply set to the last observed voltage. This approach is simple 
and can provide a good estimation for some applications. 
However, a more accurate approach is to calculate this mean 
as a mean of all observed voltages. This approach is illustrated 
in black. In the last model (yellow), in addition to a mean for 
calculation of Eg, this mean is also used to calculate the base 
aging factor as in (531. It should be noted that the MRL has a 
maximum limit which is based on mechanical/chemical wear- 
out. Hence, even if no voltage is applied to this capacitor, the 
MRL will no go higher than 15500hours. This is the reason 
for the clamping effect in model 1 in this figure. 


The aging factor (acceleration factor) for the capacitor is 
shown in Fig. It can be observed that the average aging 
factor is 3.7. This number is used in conventional methods. 
This number shows that the capacitor is 3.7 times older than 
the calendar time. In this paper, aging factor is updated at each 
sample. For instance. Fig. [^illustrates aging factor for models 
1 and 2 in the above discussion. Therefore, the infant mortality 
region will end in 2500 h as it can be observed from Fig. [^ 
A direct comparison between exponential, Weibull, and the 
proposed three example models, is shown in Fig. [^ In this 
example, the voltage of the battery is 120V till 1500h. Then, 
this voltage is increased to 125V and kept constant. Response 
of each model can be observed from this figure. Model 1 has 
the fastest response while model 3 has the slowest response. 
At f > 00 , the proposed models converge. 
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Fig. 7. Step change comparison for various models. 


V. Applications and Further Research 

Although this model was studied for survival analysis of elec¬ 
trical components, the same methodology can be incorporated 
to derive a distribution function for availability of energy from 
renewable energy sources. 

On the other hand, the moving average used for calculation 
of the parameters in this paper can be replaced with a more 
accurate estimator to get a better sense of the underlying 
distribution functions. 

VI. Conclusions 

A method for on-line calculation of the mean residual life 
of electrical components was introduced in this paper. The 
proposed method is based on applications of Cox processes in 
modeling the probability of the failure of components under 
stochastic variation of operational conditions. This method is 
applicable to various reliability models. In this paper, this 
method was developed for reliability analysis using a bathtub 
failure rate function. Using this method, expected residual life 
of a component was derived. Due to on-line observation of 
the stress levels, this method provides more accurate results in 
estimating the age of the system and predicting the probability 
of failures. Moreover, by tuning the stochastic failure rate 
function based on the observed stress levels, Cox processes 
can provide a good models for survival analysis of various 
components. This paper presented an experimental case study 
to demonstrate the application of the proposed method. 
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